Learning curves for drug response prediction in cancer cell lines
نویسندگان
چکیده
Abstract Background Motivated by the size and availability of cell line drug sensitivity data, researchers have been developing machine learning (ML) models for predicting response to advance cancer treatment. As studies continue generating a common question is whether generalization performance existing prediction can be further improved with more training data. Methods We utilize empirical curves evaluating comparing data scaling properties two neural networks (NNs) gradient boosting decision tree (GBDT) trained on four screening datasets. The are accurately fitted power law model, providing framework assessing behavior these models. Results demonstrate that no single model dominates in terms across all datasets sizes, thus suggesting actual shape depends unique pair an ML dataset. multi-input NN (mNN), which gene expressions cells molecular descriptors input into separate subnetworks, outperforms single-input (sNN), where features concatenated layer. In contrast, GBDT hyperparameter tuning exhibits superior as compared both NNs at lower range set sizes tested datasets, whereas mNN consistently performs better higher sizes. Moreover, trajectory suggests increasing sample expected improve scores NNs. These observations benefit using evaluate models, broader perspective overall characteristics. Conclusions A curve provides forward-looking metric analyzing serve co-design tool guide experimental biologists computational scientists design future experiments prospective research studies.
منابع مشابه
Efficient parameterization of large-scale mechanistic models enables drug response prediction for cancer cell lines
Institute of Computational Biology, Helmholtz Zentrum München, 85764 Neuherberg, Germany Chair of Mathematical Modeling of Biological Systems, Center for Mathematics, Technische Universität München, 85748 Garching, Germany Alacris Theranostics GmbH, 12489 Berlin, Germany Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany Dahlem Centre for Genome Research and Medical Systems Biol...
متن کاملCelecoxib Up Regulates the Expression of Drug Efflux Transporter ABCG2 in Breast Cancer Cell Lines
Elevated expression of the drug efflux transporter ABCG2 seems to correlate with multidrug resistance of cancer cells. Specific COX-2 inhibitor celecoxib has been shown to enhance the sensitivity of cancer cells to anticancer drugs. To clarify whether ABCG2 inhibition is involved in the sensitizing effect of celecoxib, we investigated whether the expression of ABCG2 in breast cancer cell lines ...
متن کاملCelecoxib Up Regulates the Expression of Drug Efflux Transporter ABCG2 in Breast Cancer Cell Lines
Elevated expression of the drug efflux transporter ABCG2 seems to correlate with multidrug resistance of cancer cells. Specific COX-2 inhibitor celecoxib has been shown to enhance the sensitivity of cancer cells to anticancer drugs. To clarify whether ABCG2 inhibition is involved in the sensitizing effect of celecoxib, we investigated whether the expression of ABCG2 in breast cancer cell lines ...
متن کاملCancer cell lines for drug discovery and development.
Despite the millions of dollars spent on target validation and drug optimization in preclinical models, most therapies still fail in phase III clinical trials. Our current model systems, or the way we interpret data from them, clearly do not have sufficient clinical predictive power. Current opinion suggests that this is because the cell lines and xenografts that are commonly used are inadequat...
متن کاملImportance of collection in gene set enrichment analysis of drug response in cancer cell lines
Gene set enrichment analysis (GSEA) associates gene sets and phenotypes, its use is predicated on the choice of a pre-defined collection of sets. The defacto standard implementation of GSEA provides seven collections yet there are no guidelines for the choice of collections and the impact of such choice, if any, is unknown. Here we compare each of the standard gene set collections in the contex...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Bioinformatics
سال: 2021
ISSN: ['1471-2105']
DOI: https://doi.org/10.1186/s12859-021-04163-y